PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG016864t1
Common NameTCM_016864
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 277aa    MW: 31554.9 Da    PI: 4.4394
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG016864t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.12e-2065118356
                       --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   3 kRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       k++++t+eq++ Le+ Fe ++++  e++++LAkklgL+ rqV vWFqNrRa++k
  Thecc1EG016864t1  65 KKRRLTAEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRARWK 118
                       56789************************************************9 PP

2HD-ZIP_I/II134.14.9e-4364156193
       HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreelke 93 
                       ekkrrl++eqv+lLE+sFe+e+kLeperK++la++LglqprqvavWFqnrRAR+ktkqlE+dy+ Lk++yd+l+++ + + +e+e+L++e+++
  Thecc1EG016864t1  64 EKKRRLTAEQVHLLEKSFETENKLEPERKTQLAKKLGLQPRQVAVWFQNRRARWKTKQLERDYDLLKSSYDSLVSNYDCIVQENEKLKSEVAS 156
                       69**************************************************************************************99875 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.56E-2049122IPR009057Homeodomain-like
PROSITE profilePS5007118.0160120IPR001356Homeobox domain
SMARTSM003898.9E-2063124IPR001356Homeobox domain
CDDcd000862.45E-1865121No hitNo description
PfamPF000461.4E-1765118IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.603.2E-2266127IPR009057Homeodomain-like
PRINTSPR000314.1E-691100IPR000047Helix-turn-helix motif
PROSITE patternPS00027095118IPR017970Homeobox, conserved site
PRINTSPR000314.1E-6100116IPR000047Helix-turn-helix motif
PfamPF021836.6E-18120162IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0001558Biological Processregulation of cell growth
GO:0009637Biological Processresponse to blue light
GO:0009651Biological Processresponse to salt stress
GO:0009965Biological Processleaf morphogenesis
GO:0045893Biological Processpositive regulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0042803Molecular Functionprotein homodimerization activity
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 277 aa     Download sequence    Send to blast
MESGRLFFNS STCHGNMLFL GNCDPVFRGA RTMISMEETS KRRPFFSSPE DMYDEEYYDE  60
QLPEKKRRLT AEQVHLLEKS FETENKLEPE RKTQLAKKLG LQPRQVAVWF QNRRARWKTK  120
QLERDYDLLK SSYDSLVSNY DCIVQENEKL KSEVASLTEK LQAKDATTEP VIGQKPEPLP  180
ADIVSSLQFS VKVEDRQSTG SAGSAVVDED APQLLDSGDS YFPSDEYPGG CVGPVNRLQS  240
EEDDGSDDGR SYFSNVFTAT EEQQQHEESL GWWVWS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1112120RRARWKTKQ
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00323DAPTransfer from AT3G01470Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007031433.10.0Homeobox-leucine zipper protein HAT5
SwissprotQ022831e-114HAT5_ARATH; Homeobox-leucine zipper protein HAT5
TrEMBLA0A061EJ860.0A0A061EJ86_THECC; Homeobox-leucine zipper protein HAT5
STRINGVIT_14s0066g01440.t011e-136(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM88842738
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G01470.11e-113homeobox 1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]